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Abstract 

The observation of foreshocks preceding large earthquakes and the suggestion that foreshocks 
have specific properties that may be used to distinguish them from other earthquakes have raised 
the hope that large earthquakes may be predictable. Among proposed anomalous properties are 
the larger proportion than normal of large versus small foreshocks, the power law acceleration of 
seismicity rate as a function of time to the mainshock and the spatial migration of foreshocks 
toward the mainshock, when averaging over many sequences. Using Southern California 
seismicity, we show that these properties and others arise naturally from the simple model that 
any earthquake may trigger other earthquakes, without arbitrary distinction between foreshocks, 
aftershocks and mainshocks. We find that foreshocks precursory properties are independent of 
the mainshock size. This implies that earthquakes (large or small) are predictable to the same 
degree as seismicity rate is predictable from past seismicity by taking into account cascades of 
triggering. The cascades of triggering give rise naturally to long-range and long-time 
interactions, which can explain the observations of correlations in seismicity over surprisingly 
large length scales. 
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1. Foreshocks, mainshocks and 
aftershocks: hypothesis and 
predictions 

It has been recognized for a long time that large 
earthquakes are sometimes preceded by an acceler- 
ation of the seismic activity, known as foreshocks 
[Jones and Molnar, 1976; Abercrombie and Mori, 
1996]. In addition to the increase of the seismicity 
rate a few hours to months before large earthquakes, 
other properties of foreshocks have been reported, 
which suggest their usefulness (when present) as pre- 
cursory patterns for earthquake prediction. Special 
physical mechanisms have been proposed for fore- 
shocks with the hope of helping earthquake prediction 
hope of helping earthquake prediction [Yamashita 
and Knopoff, 1989; SorneMe et ai, 1992; Dodge et 
al., 1996; Dieterich and Kilgore, 1996]. In addi- 
tion, anomalous precursory seismic activity extend- 
ing years to decades before large earthquakes and at 
distances up to ten times the mainshock rupture size 
are often thought to require different physical mecha- 
nisms [Keilis-Borok and Malinovskaya, 1964; Knopoff 
et al, 1996; Bowman et al, 1998; Jaume and Sykes, 
1999; Sammis and Sornette, 2002; Keilis-Borok, 2002] 
than for foreshocks closer to the mainshock epicenters. 

The division between foreshocks, mainshocks, and 
aftershocks has a long and distinguished history in 
seismology. Within a prc-specified space-time do- 
main, foreshocks are usually defined as earthquakes 
(above the background rate) preceding a larger earth- 
quake (mainshock), which is itself followed by an 
increase in seismicity of smaller earthquakes (after- 
shocks). However, recent empirical and theoretical 
scrutiny suggests that this division might be arbi- 
trary and physically artificial [Kagan and Knopoff, 
1981; Shaw, 1993; Jones et al, 1995; Hough and 
Jones, 1997; Felzer et al, 2002]. Since the imdcr- 
lying physical processes are not fully understood, the 
qualifying time and space windows used to select af- 
tershocks, mainshocks and aftershocks are more based 
on common sense than on hard science. If the space- 
time window is extended and a new event not con- 
sidered previously is found with a magnitude larger 
than the previously classified mainshock, it becomes 
the new mainshock and all preceding events are ret- 
rospectively called foreshocks. A clear identification 
of foreshocks, aftershocks and mainshocks is hindered 
by the fact that nothing distinguishes them in their 
seismic signatures: at the present level of resolu- 
tion of seismic inversions, they are found to have 



the same double-couple structure and the same ra- 
diation patterns [Hough and Jones, 1997]. Statisti- 
cally, the aftershock magnitudes are distributed ac- 
cording to the Gutenberg-Richter (GR) distribution 
P{m) ^ lO^''™ with a 6-value similar to other earth- 
quakes [Ranalli, 1969; Utsu et al, 1995]. How- 
ever, some studies [Knopoff et al, 1982; Molchan 
and Dmitrieva, 1990; Molchan et al., 1999] have sug- 
gested that foreshocks have a smaller 6-value than 
other earthquakes, but the physical mechanisms are 
not yet understood. 

The Omori law describes the power law decay of 
the aftershock rate l/(i — tcY with time from a 
mainshock that occurred at tc [Omori, 1894; Kagan 
and Knopoff, 1978; Utsu et al., 1995], and which may 
last from months up to decades. In contrast with the 
well-defined Omori law for aftershocks, there arc huge 
fluctuations of the foreshock seismicity rate, if any, 
from one sequence of earthquakes to another one. Fig- 
ure 1 shows foreshock scqiiences of all M > 6.5 main- 
shocks in the catalog of Southern California seismicity 
provided by the Southern California Seismic Network 
for the period 1932-2000. By stacking many foreshock 
sequences, a well-defined acceleration of the seismic- 
ity preceding mainshocks emerges, quantified by the 
so-called inverse Omori law ^ \/{tc~ t)P , where tc is 
the time of the mainshock [Papazachos, 1973; Kagan 
and Knopoff, 1978; Jones and Molnar, 1979]. While 
we see clearly an acceleration for the averaged fore- 
shock number (black line in Figure 1), there are huge 
fluctuations of the rate of foreshocks for individual 
sequences. Most foreshock sequences are character- 
ized by the occurrence of a major earthquake before 
the mainshock, which has triggered the mainshock 
most probably indirectly due to a cascade of multiple 
triggering. For instance, 66 days before its occur- 
rence, the M = 7.3 Landers earthquake (upper curve 
in Figure 1) was preceded by the M = 6.1 Joshua- 
Tree earthquake. The successive oscillations of the 
cumulative number of events after the Joshua- Tree 
earthquake correspond to secondary, tertiary, etc., 
bursts of triggered seismicity. The inverse Omori law 
is usually observed for time scales shorter than the 
direct Omori law, of the order up to weeks to a few 
months before the mainshock. However, there seems 
to be no way of identifying foreshocks from usual af- 
tershocks and mainshocks in real time (sec [Jones 
et al., 1995; Hough and Jones, 1997] for a pioneer- 
ing presentation of this view point). In other words, 
is the division between foreshocks, mainshocks and 
aftershocks falsifiable [Kagan, 1999]? 
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In order to address this question, we present a novel 

analysis of seismic catalogs, based on a parsimonious 
model of the forcshocks, mainshocks and aftershocks, 
in terms of earthquake triggering: earthquakes may 
trigger other earthquakes through a variety of physi- 
cal mechanisms [Harris, 2001] but this does not al- 
low one to put a tag on them. Thus, rather than 
keeping the specific classification that foreshocks are 
precursors of mainshocks and mainshocks trigger af- 
tershocks, we start from the following hypothesis: 

Hypothesis Foreshocks, mainshocks, and after- 
shocks are physically indistinguishable. 

In this paper, we study a simple model of seismic- 
ity based on this Hypothesis and demonstrate that it 
can explain many properties of foreshocks, such as the 
larger proportion than normal of large versus small 
foreshocks, the power law acceleration of seismicity 
rate as a function of time to the mainshock and the 
spatial migration of foreshocks toward the mainshock, 
when averaging over many sequences. These proper- 
ties and others arise naturally from the simple model 
that any earthquake may trigger other earthquakes, 
without arbitrary distinction between foreshocks, af- 
tershocks and mainshocks. 

1.1. Definition of the model 

The simplest construction that embodies the Hy- 
pothesis is the epidemic-type aftershock sequence 
(ETAS) model introduced in [Kagan and Knopoff, 
1981; Kagan and Knopoff, 1987] (in a slightly differ- 
ent form) and in [Ogata, 1988]. In this model, all 
earthquakes may be simultaneously mainshocks, af- 
tershocks and possibly forcshocks. An observed "af- 
tershock" sequence in the ETAS model is the sum of 
a cascade of events in which each event can trigger 
more events. The triggering process may be caused 
by various mechanisms that either compete or com- 
bine, such as pore-pressure changes due to pore-fluid 
flows coupled with stress variations, slow redistribu- 
tion of stress by aseismic creep, rate-and-state de- 
pendent friction within faults, coupling between the 
viscoelastic lower crust and the brittle upper crust, 
stress-assisted micro-crack corrosion, etc.. 

The epidemic-type aftershock (ETAS) model as- 
sumes that a given event (the "mother" ) of magnitude 
rui occurring at time ti and position fi gives birth to 
other events ( "daughters" ) of any possible magnitude 
m at a later time between t and t + dt and at point 
f±dr at the rate 

(l>m,{t-ti,f-ri)=p{mi) -^{t-ti) ^f-fi) . (1) 



We will refer to 4'mi{t — ti,f — fi) as the "local"' 
Omori law, giving the seismic rate induced by a sin- 
gle mother. It is the product of three independent 
contributions: 

1. p{mi) gives the number of daughters born from 
a mother with magnitude m^. This term is in 
general chosen to account for the fact that large 
earthquakes have many more triggered events 
than small earthquakes. Specifically, 

p{mi) = K 10"™* , (2) 

which is in good agreement with the observa- 
tions of aftershocks productivity (see Helmstet- 
ter [2003] and references therein). 

2. ^'(t — ij) is a normalized waiting time distribu- 
tion giving the rate of daughters born at time 
t — ti after the mother 



The "local" Omori law ^{t) is generally signifi- 
cantly different from the observed Omori law of 
aftershocks. We have shown in [Helmstetter and 
Sornette, 2002a] that the exponent p of the ob- 
served "global" Omori law is equal to or smaller 
than the exponent 1-1-0 of the "local" Omori law 
^'(t) due to the cascades of triggering. 

3. $(r — fi) is a normalized spatial "jump" distri- 
bution from the mother to each of her daughter, 
quantifying the probability for a daughter to be 
triggered at a distance Jr — fi) from the mother. 
Specifically, we take 

$(f) = ^ , (4) 

where // is a positive exponent and d is a char- 
acteristic dimension of the earthquake source, 
as suggested by [Ogata, 1999]. 

The last ingredient of the ETAS model is that 
the magnitude m of each daughter is chosen inde- 
pendently from that of the mother and of all other 
daughters according to the Gutenberg-Richter distri- 
bution 

P(m) = b In(lO) iQ-Hm-mo) ^ 

with a 6- value usually close to 1. mg is a lower bound 
magnitude below which no daughter is triggered. 
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The total seismicity rate is the sum of all after- 
shocks of past events and of a constant external source 
s used to model the constant tectonic loading. The 
full distribution of seismicity in time, space and mag- 
nitude is thus described by the 8 parameters of the 
ETAS model K, c, 9, fx, d, a, b and s. See [Ogata, 
1999], [Helmstetter and Sornette, 2002a] and refer- 
ences therein for a discussion of the values of these 
parameters in real seismicity. 

The ETAS model has been used previously to 
give short-term probabilistic forecast of seismic ac- 
tivity [Kagan and Knopoff, 1987; Kagan and Jack- 
son, 2000; Console and Murru, 2001], and to describe 
the temporal and spatial clustering of seismic activ- 
ity [Ogata, 1988; Kagan, 1991; Kagan and Jackson, 
2000; Console and Murru, 2001; Felzer et al, 2002]. 
The Hypothesis and the ETAS model allow us to 
study two classes of foreshocks, called of type I and 
of type II defined in the next section. 

1.2. Definition of foreshocks of type I and of 
type II 

1.2.1. Formal definitions. The usual defini- 
tion of foreshocks, that we shall call "foreshock of type 
I," refers to any event of magnitude smaller than or 
equal to the magnitude of the following event, then 
identified as a "mainshock." This definition implies 
the choice of a space-time window R x T used to de- 
fine both foreshocks and mainshocks. Mainshocks are 
large earthquakes that were not preceded by a larger 
event in this space-time window. 

In contrast, the Hypothesis that the same physical 
mechanisms describe the triggering of a large earth- 
quake by a smaller earthquake (mainshock triggered 
by a type I foreshock) and the triggering of a small 
earthquake by a larger event (aftershock triggered by 
a mainshock) makes it natural to remove the con- 
straint that foreshocks must be smaller than the main- 
shock. We thus define "foreshock of type H" as any 
earthquake preceding a large earthquake which is de- 
fined as the mainshock, independently of the relative 
magnitude of the foreshock compared to that of the 
mainshock. That is, foreshocks of type II may actu- 
ally be larger than their mainshock. 

The advantage of this second definition is that fore- 
shocks of type II are automatically defined as soon 
as one has identified the mainshocks, for instance, 
by calling mainshocks all events of magnitudes larger 
than some threshold of interest. Foreshocks of type 
II are thus all events preceding these large magnitude 
mainshocks in a space-time window R x T. In con- 



trast, foreshocks of type I need to obey a constraint 
on their magnitude, which may be artificial, as we 
shall see further down. 

1.2.2. Practical implementation. In our anal- 
ysis of the SCEC catalog, we construct foreshock 
and aftershock sequences as follows. A mainshock 
is defined as an earthquake in the magnitude range 
(M, M + AM) that was not preceded by a larger event 
in a space-time window {R2, T) before the mainshock. 
The distance R2 = 50 km is here chosen to be close 
to (but smaller than) the maximum size of the spa- 
tial clusters of seismicity in the California catalog, in 
order to minimize the influence of large earthquakes 
that occurred before the mainshock. Other choices 
between 20 km to 200 km have been tested and give 
essentially the same results. The aftershocks are all 
events that occurred in a space-time window (R, T) 
after each mainshock. The foreshocks of type I are 
selected in a space-time window R, T before each 
mainshock. Foreshocks of type II are selected in the 
same space-time window {R, T) before each main- 
shock, now defined without the constraint that they 
were not preceded by a larger event in a space-time 
window (i?2, T). The only difference between fore- 
shocks of type I and of type II is that the selection of 
their respective mainshocks is different. 

The minimum magnitude cutoff used for after- 
shocks and foreshocks is Mq = 3. The fact that the 
SCEC catalog is not complete below M = 3.5 for the 
first part of the catalog 1932-1975 does not affect the 
results on the inverse and direct Omori laws, it simply 
under-estimates the mimbcr of foreshocks and after- 
shocks but does not change the temporal evolution 
of the rate of seismicity before and after mainshocks. 
The incompleteness of the catalog for M < 3.5 before 
1975 explains the roll-off of the magnitude distribu- 
tion for small magnitudes. Even for the most recent 
part, the SCEC catalog is not complete above M — 3 
in the few hours or days after a large M > 6 earth- 
quake, due to the saturation of the seismic network. 
These missing events may induce a spurious roll-off 
of both the direct and inverse Omori laws at short 
times before or after a mainshock, but cannot induce 
a spurious acceleration of the rate of seismicity before 
mainshocks. 

We stack all foreshocks and aftershocks sequences 
synchronized at the time of the "mainshocks" in dif- 
ferent mainshock magnitude intervals [M,M + AM], 
and for different choices of the space-time window R, 
T used to define foreshocks and aftershocks. We use 
larger magnitude intervals AM for larger mainshock 
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magnitudes M to compensate for the smaller number 

of large mainshocks. R has been tested between 10 km 
and up to 500 km with no essential change, except for 
an increasing sensitivity to the background seismicity 
for the largest R (see Figure 3). T has been tested be- 
tween 0.5 year to 10 years with similar results. Tests 
have also been performed with the spatial window size 
R adjusted to scale with the mainshock magnitude 
with no significant difference. Our results presented 
below thus appear robust with respect to the (arbi- 
trary) definitions of the space-time windows and the 
definition of mainshocks. In the main text, foreshocks 
of type I and of type II are treated separately. Pre- 
vious studies of foreshocks using a stacking method 
[Papazachos, 1975a; Papazachos, 1975b; Jones and 
Molnar, 1976; Kagan and Knopoff, 1978; Jones and 
Molnar, 1979; Shaw, 1993; Reasenberg, 1999] have 
considered foreshocks of type I only. 

1.3. Foreshock properties derived from the 
ETAS model 

The simple embodiment of the Hypothesis in the 
ETAS model leads to the following consequences and 
predictions [Helmstetter et al., 2003], which are pro- 
posed as crucial tests of the Hypothesis. 

1. The rate of foreshocks of type II is predicted 
to increase before the mainshock according to 
the inverse Omori law N{t) ^ l/{tc — ty with 
an exponent p' smaller than the exponent p of 
the direct Omori law. The exponent p' depends 
on the "local" Omori exponent \ + 9 describing 
the direct triggering rate between earthquakes 
(first-generation triggering), on the 6- value of 
the GR distribution and on the exponent a 
quantifying the increase oc 10"*^ in the number 
of aftershocks as a function of the magnitude 
M of the mainshock [Helmstetter, 2003]. The 
inverse Omori law also holds for foreshocks of 
type I preceding large mainshocks. The inverse 
Omori law results from the direct Omori law 
for aftershock and from the multiple cascades 
of triggering. The inverse Omori law emerges 
as the expected (in a statistical sense) trajec- 
tory of seismicity, conditioned on the fact that 
it leads to the burst of seismic activity accom- 
panying the mainshock. 

2. In contrast with the direct Omori law, which 
is clearly observed after all large earthquakes, 
the inverse Omori law is observed only when 
stacking many foreshock sequences. Even for 



small mainshocks, for which the number of af- 
tershocks is similar to the number of foreshocks, 
there are much larger fluctuations in the rate of 
foreshocks before mainshocks than in the rate 
of aftershocks. 

3. While the number of aftershocks increases as 
10"^ with the magnitude M of the mainshock, 
the number of foreshocks of type II is predicted 
to be independent of M. Thus, the seismicity 
should increase on average according to the in- 
verse Omori law before any earthquake, what- 
ever its magnitude. For foreshocks of type I, 
the same results should hold for large main- 
shocks. For small and intermediate values of 
the mainshock magnitude AJ , the conditioning 
on foreshocks of type I to be smaller than their 
mainshock makes their number increase with M 
solely due to the constraining effect of their def- 
inition. 

4. The GR distribution for foreshocks is predicted 
to change upon the approach of the mainshock, 
by developing a bump in its tail. Specifically, 
the modification of the GR law is predicted to 
take the shape of an additive correction to the 
standard power law, in which the new term is 
another power law with exponent b — a. The 
amplitude of this additive power law term is 
predicted to also exhibit a power law acceler- 
ation upon the approach to the mainshock. 

5. The spatial distribution of foreshocks is pre- 
dicted to migrate toward the mainshock as the 
time increases toward the time of the main- 
shock, by the mechanism of a cascade of seismic 
triggering leading to a succession of jumps like 
in a continuous-time random walk [Helmstetter 
and Sornette, 2002b]. 

We now proceed to test systematically these pre- 
dictions on the catalog of the Southern California 
Data Center (SCEC) over the period 1932-2000, which 
is almost complete above M = 3 and contains more 
than 22000 M > 3 earthquakes, using the methodol- 
ogy described in section 1.2. 
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2. Properties of foreshocks, 

mainshocks and aftershocks and 
comparison with the ETAS model 

2.1. Observation of the direct and inverse 
Omori law 

Figure 2 shows the rate of foreshocks of type II as 
a function of tc — t and of aftershocks as a function 
oi t — tc, where tc is the time of the mainshocks, for 
different mainshock magnitude between 3 and 7. Af- 
tershocks and foreshocks have been selected with the 
space-time window T = 1 yr and i? = 50 km. Both 
rates follow an approximate power law (inverse Omori 
law for foreshocks with exponent p' and Omori law for 
aftershocks with exponent p). 

The fluctuations of the rate of foreshocks are larger 
for large mainshocks, because the number of main- 
shocks (resp. foreshocks) decreases from 15584 (resp. 
1656249) for the magnitude range 3 — 3.5 down to 
47 (resp. 1899) for M > 6 mainshocks. In con- 
trast, the fluctuation of the rate of aftershocks are 
larger for small mainshocks magnitudes, due to the 
increase of the number of aftershocks per mainshock 
with the mainshock magnitude, and to the rules of 
mainshock selection which reject a large proportion 
of small earthquakes. The number of mainshocks is 
different for aftershocks and foreshocks due to their 
distinct definition. The number of mainshocks associ- 
ated with aftershocks decreases from 677 for the mag- 
nitude range 3—3.5 down to 39 for M > 6 mainshocks. 
The number of aftershocks increases from 2614 for 
3 < M < 3.5 up to 7797 for M > 6 mainshocks. The 
fluctuations of the data makes it hard to exclude the 
hypothesis that the two power laws have the same ex- 
ponent p = p' = 1., even if p' seems slightly smaller. 
Note that it can be shown theoretically for a < b/2 
that p = 1 — and p' = 1 — 29 for a local Omori 
law with exponent 1 + 6 and that the difference p — p' 
should get smaller as a increases above b/2 [Helm- 
stetter et ai, 2003]. Since a ^ 0.8 > b/2 0.5, this 
limit is met which explains the smallness of the dif- 
ference p — p' . The truncation of the seismicity rate 
for small times \tc — t\ < 1 day, especially for after- 
shocks of large M > 6 mainshocks and for foreshocks, 
is due to the incompleteness of the catalog at very 
short times after mainshocks due to the saturation of 
the seismic network. At large times from the main- 
shock, the seismicity rate decreases to the level of the 
background seismicity, as seen clearly for the rate of 
aftershocks following small M = 3 mainshocks. 

The second striking observation is the strong vari- 



ation of the amplitude of the rate Na{t) of aftershocks 
as a function of the magnitude M of the mainshock, 
which is well-captured by an exponential dependence 
Na{t) oc 10«^/(t - QP with a ^ 0.8 [Helmstetter, 
2003]. In contrast, the rates of foreshocks of type 
II are completely independent of the magnitude M 
of the mainshocks: quite strikingly, all mainshocks 
independently of their magnitudes are preceded by 
the same statistical inverse Omori law, with the same 
power law increase and the same absolute amplitude! 
All these results are very well modeled by the ETAS 
model with the parameters a = 0.8, 9 = 0.2 and 
b = I using the theoretical framework and numerical 
simulations developed in [Helmstetter et al, 2003]. 

Another remarkable observation is presented in 
Figure 3 which shows the rate of foreshocks of type 
II for mainshock magnitudes between 4 and 4.5, for 
different values of the distance R used to select af- 
tershocks and foreshocks. The inverse Omori law is 
observed up to i? sa 200 km, and the duration of the 
foreshock sequences increases as R decreases due to 
the decrease of the effect of the background seismic- 
ity. Restricting to the shortest distances R to min- 
imize the impact of background seismicity, the in- 
verse Omori laws can be observed up to 10 yrs be- 
fore mainshocks, for foreshocks of type II (Figure 
4). Thus, foreshocks are not immediate precursors 
of mainshocks but result from physical mechanisms 
of earthquake triggering acting over very long times 
and large distances. 

An important question concerns the relative weight 
of coincidental shocks, i.e., early aftershocks triggered 
by a previous large earthquake, which appear as fore- 
shocks of type II to subsequent aftershocks (seen as 
mainshocks of these foreshocks of type II). Such co- 
incidental shocks can give rise to an apparent inverse 
Omori law [Shaw, 1993] when averaging over all pos- 
sible positions of "mainshocks" in the sequence, with- 
out any direct interaction between these mainshocks 
and preceding events viewed as their foreshocks. Ac- 
tually, these coincidental shocks form a minority of 
the total set, because the fraction of shocks directly 
triggered by a mainshock decays to negligible values 
beyond a few days for the range of parameters of the 
ETAS that realistically fit the SCEC catalog [Felzer 
et al, 2002]. 

Figure 5 shows the rate of foreshocks of type I as 
a function oi tc — t and of aftershocks as a function 
of t — tc, where tc is the time of the mainshocks. The 
total number of foreshocks of type I is much smaller 
that the number of type II foreshocks for small main- 
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shocks because a significant fraction of foreshocks of 
type II are "aftershocks" of large M > 6 earthquakes 
according to the usual definition and are therefore re- 
jected from the analysis of foreshocks of type I, which 
are constrained to be smaller than their mainshock. 
There arc much larger fluctuations for foreshocks of 
type I than for foreshocks of type II due to the smaller 
number of the former. Nevertheless, type I and type 
II foreshocks clearly follow the same trajectory of in- 
creased activity before a mainshock, suggesting that 
type I forc^shocks, like type II foreshocks, are triggers 
of the mainshock. 

The exponent p' of the inverse Omori law for fore- 
shocks of type I is approximately equal to the expo- 
nent of foreshocks of type II and to the exponent p of 
the direct Omori law for aftershocks. The rate of fore- 
shocks of type I increases slowly with the mainshock 
magnitude but this increase is not due to a larger 
predictability of larger earthquakes, as expected for 
example in the critical point theory [Sammis and 
Sornette, 2002] and as observed in a numerical model 
of seismicity [Huang et al., 1998]. The increase of the 
number of type I foreshocks with the mainshock mag- 
nitude can be reproduced faithfully in synthetic cat- 
alogs generated with the ETAS model and is nothing 
but the consequence of the algorithmic rules used to 
define foreshocks of type I. Namely, by definition, big- 
ger mainshocks are allowed to have bigger type I fore- 
shocks and therefore they have more type I foreshocks 
than smaller mainshocks. In other words, there is 
no physics but only statistics in the weak increase 
of foreshocks of type I with the mainshock magni- 
tude. Confirming this concept, the inverse Omori 
law for foreshocks of type I becomes independent of 
the mainshock magnitudes M for large M, for which 
the selection constraint has only a weak effect. The 
dependence of the inverse Omori law for foreshocks 
of type I as a function of distance R used to select 
foreshocks is very similar (not shown) to that shown 
for foreshocks of type II in Figure 3. However, the 
duration of foreshock sequences is shorter for type I 
foreshocks because the number of type I foreshocks is 
smaller than the number of type II foreshocks due to 
the rules of foreshock selection. 

2.2. Modification of the magnitude 
distribution before a mainshock 

We have shown in [Helmstetter et ai, 2003] that 
selecting and stacking foreshock sequences in the 
ETAS model leads to a modification of the magni- 
tude distribution compared to the theoretical distri- 



bution. We refer to [Helmstetter et al, 2003] for the 
derivation of the results stated below, which is too in- 
volved to be reported here. The distribution P{m) of 
foreshock magnitudes is predicted to get an additive 
power law contribution q{t)dP{m) with an exponent 
b' smaller than h and with an amplitude q{t) growing 
as a power law of the time to the mainshock: 

P(m) = (1 - q{t))Po{m) + q(t) dP{m) , (6) 

where Po{m) is the standard GR distribution Po{m) ~ 
10-^™ and dP{m) ~ 10"^' with b' = b - a. The 
amplitude q(t) of the additive distribution dP(m) in 
(6) should increase as a power-law of the time to the 
mainshock according to 

qit)^l/it,~tf^ . (7) 

This analytical prediction has been checked with ex- 
tensive numerical simulations of the ETAS model. 
This change of the magnitude distribution for fore- 
shocks docs not mean that foreshocks belong to a dif- 
ferent population, but simply results from the defi- 
nition of foreshocks, which are only defined as fore- 
shocks after the mainshock occmred. The mechanism 
giving rise to the change of distribution for foreshocks 
is explained in [Helmstetter et ai, 2003]. Intuitively, 
the modification of the magnitude distribution of fore- 
shocks results from the increase of the number of trig- 
gered events with the mainshock magnitude. There 
are few large earthquakes, but they trigger many more 
earthquakes than smaller earthquakes. As a conse- 
quence, a large fraction of mainshocks are triggered, 
directly or indirectly, by large foreshocks. The pro- 
portion of large foreshocks is thus larger than the pro- 
portion of large earthquakes in the whole population, 
and gives an apparent lower 6-value for foreshocks 
than for other earthquakes. The magnitude distri- 
bution of triggering events is given by the product 
p{m)Po{m) ~ io»rniQ-bm ^ dP{m), which gives the 
distribution of foreshock magnitudes for large m. 

We now test this prediction using the SCEC cat- 
alog on foreshocks of type II of M > 3 mainshocks, 
selected using R = 20 km and T = 1 yr. The mag- 
nitude distribution P(m) sampled at different times 
before mainshocks is shown in panel (a) of Figure 6, 
where the black to gray curves correspond to times 
preceding mainshocks decreasing from 1 year to 0.01 
day with a logarithmic binning. As time approaches 
that of the mainshocks, one can clearly observe that 
the tails depart more and more from the standard 
GR power law Po{m) with b = 1.0 ± 0.1 estimated 
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using the whole catalog and shown as the dashed 
line. The perturbation q(t)dP{m) in (6) of the fore- 
shock magnitude distribution, shown in panel b) of 
Figure 6, can be estimated by fitting the prcdic;tion 
(6) to the observed magnitude distribution of fore- 
shocks, by inverting the parameters b' and q{t) in (6) 
for different times before mainshocks. The obtained 
additive GR laws q{t)dP{m) are compatible with pure 
power laws with an approximately constant exponent 
b' = 0.6 ± 0.1 shown in panel (d) of Figure 6, ex- 
cept at very long times before the mainshock where 
it drops to when the amplitude q{t) of the perturba- 
tion becomes too small. The amplitude q{t) is shown 
in panel (c) and is compatible with a power law (7) 
with a fitted exponent 0.3 ± 0.2. These observations 
are in good qualitative agreement with the predic- 
tions (6) and (7) on the nature of the modification of 
the GR law for foreshocks in terms of a pure additive 
power law perturbation with an amplitude growing 
as a power law of the time to the mainshocks. Quan- 
titatively, b' is marginally outside the 2(T-confidence 
interval for the prediction b' = b — a = 0.2 ± 0.2 us- 
ing the estimation a = 0.8 ± 0.1 given by [Helmstet- 
ter, 2003]. We attribute this discrepancy to the dual 
impact of the incompleteness of the catalog for small 
magnitudes after a large earthquake and to the small- 
ness of the statistics. We stress that the prediction 
(6) with b' = b — a has been verified with good pre- 
cision in synthetic catalogs which do not have these 
limitations [Helmstetter et al., 2003]. Using the best 
fitted value b' = 0.6, we obtain a reasonable agree- 
ment for the predicted exponent 9b' /a and the fitted 
value 0.3±0.2 for the power law behavior of q{t) using 
6 in the range 0.2 - 0.4. 

Note that expression (6) contains as a special 
case the model in which the modification of the 
GR law occurs solely by a progressive decrease of 
the 6-value as the time of the mainshock is ap- 
proached (by putting q{t) = 1 and allowing b' to 
adjust itself as a function of time), as proposed in 
[Berg, 1968; Kagan and Knopoff, 1978; Molchan and 
Dmitrieva, 1990; Molchan et al, 1999]. Our quanti- 
tative analysis clearly excludes this possibility while 
being completely consistent with the mechanism em- 
bodied by the concept of triggered seismicity. Al- 
though the foreshock magnitude distribution is not a 
pure power-law but rather the sum of two power laws, 
our results rationalize the reported decrease of 6- value 
before mainshocks [Berg, 1968; Kagan and Knopoff, 
1978; Molchan and Dmitrieva, 1990; Molchan et al, 
1999]. Indeed, with a hmited number of events, the 



sum of two power laws predicted by (6) with an in- 
creasing weight of the additiv(^ law dP{m) as the time 
of the mainshock is approached will be seen as a de- 
creasing 6-value when fitted with a single GR power 
law. 

2.3. Migration of foreshocks 

The last prediction discussed here resulting from 
the Hypothesis is that foreshocks should migrate 
slowly toward the mainshock. Note that the speci- 
fication (1) of the ETAS model defined in section 1.1 
predicts no difi^usion or migration if seismicity results 
solely from direct triggering (first generation from 
mother to daughter). Technically, this results from 
the separability of the space and time dependence of 
4>Mi {t — ti,f — ri). In the ETAS model, diffusion and 
migration can be shown to result from the cascade of 
secondary, tertiary (and so on) triggered seismicity, 
akin to a (continuous-time) random walk with mul- 
tiple steps [Helmstetter and Sornette, 2002b], which 
couples the space and time dependence of the result- 
ing global seismicity rate. This migration or anti- 
diffusion of the seismic activity toward the mainshock 
is quantified by the characteristic size R of the cluster 
of foreshocks which is predicted to decrease before the 
mainshock according to [Helmstetter and Sornette, 
2002b; Helmstetter et al., 2003] 

~ (te - tf , (8) 

with H = 9/fj, for /i < 2 where 9 and fi are defined 
in section 1.1. It is natural that the (sub-)diffusive 
exponent H combines the exponent 9 (respectively n) 
of the time- (resp. space-) dependent local processes 
(3) and (4). This law (8) describes the localization of 
the seismicity as the mainshock approaches, which is 
also observed in real seismicity [Kagan and Knopoff, 
1978; von Seggern et al, 1981]. 

We use a superposed epoch analysis and stack all 
sequences of foreshocks of type II synchronized at the 
time of the mainshock and with a common origin of 
space at the location of each mainshock. The anal- 
ysis of the California seismicity presented in inset of 
Figure 4 shows clearly a migration of the seismicity 
toward the mainshock, confirmed by the significant 
diffusion exponent H = 0.3 ± 0.1. This value is com- 
patible with the estimates 9 = 0.2 and ^ = 1. We 
obtain the same pattern for type I and type II fore- 
shocks, but the figures for type I foreshocks have more 
noise due to the smaller number of events. However, 
this migration is likely to be an artifact of the back- 
ground activity, which dominates the catalog at long 



9 



times and distances from the mainshocks. Indeed, the 
shift in time from the dominance of the background 
activity at large times before the mainshock to that of 
the foreshock activity clustered around the mainshock 
at times just before it may be taken as an apparent in- 
verse diffusion of the seismicity rate when using stan- 
dard quantifiers of diffusion processes (see [Helmstet- 
ter and Sornette, 2002b] for a discussion of a similar 
effect for the apparent diffusion of aftershocks). 

3. Discussions and conclusions 

By defining the forcshocks of type II and by com- 
paring them with standard foreshocks of type I, we 
have revisited the phenomenology of earthquake fore- 
shocks using the point of view of triggered seismicity 
formulated in our Hypothesis. We have found that 
the most salient properties of foreshock sequences are 
explained solely by the mechanism of earthquake trig- 
gering. This validates the Hypothesis. 

An important result is that the precursory modi- 
fication of the seismic activity before a mainshock is 
independent of its magnitude, as expected by the trig- 
gering model with a constant magnitude distribution. 
Therefore, large earthquakes are not more predictable 
than smaller earthquakes on the basis of the power- 
law acceleration of the seismicity before a mainshock 
or by using the modification of the magnitude distri- 
bution. 

All these results taken together stress the impor- 
tance of the multiple cascades of earthquake trigger- 
ing in order to make sense of the complex spatio- 
temporal seismicity. In particular, our results do not 
use any of the specific physical mechanisms proposed 
earlier to accoimt for some of the observations an- 
alyzed here. For instance, the ETAS model is dif- 
ferent from the receding stress shadow model [Bow- 
man and King, 2001; Sam,m,is and Sornette, 2002], 
from the critical earthquake model [Bowman et al., 
1998; Jaume and Sykes, 1999; Sammis and Sornette, 
2002] and from the pre-slip model [Dodge et al., 1996], 
which in addition each addresses only a specific part 
of the seismic phenomenology. Our demonstration 
and/or confirmations of (i) the increase of rate of fore- 
shocks before mainshocks (ii) at large distances and 
(iii) up to decades before mainshocks, (iv) a change of 
the Gutenberg-Richtcr law from a concave to a con- 
vex shape for foreshocks, and (v) the migration of 
foreshocks toward mainshocks are reminiscent of, if 
not identical to, the precursory patterns documented 
in particular by the Russian [Keilis-Borok and Ma- 



linovskaya, 1964; Keilis-Borok, 2002] and Japanese 
[Mogi, 1995] schools, whose physical origin has re- 
mained elusive an/or controversial. The present work 
suggests that triggered seismicity is sufficient to ex- 
plain them. The concepts and techniques and their 
variations developed here could be applied to a variety 
of problems, such as to determine the origin of finan- 
cial crashes [Sornette et at, 2003; Johansen and, Sor- 
nette, 2003], of major biological extinctions, of change 
of weather regimes and of the climate, and in tracing 
the somcc of social upheaval and wars [Sornette and 
Helmstetter, 2003]. 

The cascade model described here is sufficient to 

explain the properties of foreshocks in time, space 
and magnitude. There may be however other prop- 
erties of foreshocks not explained by this model, such 
has an unusual waveform or focal mechanism, that 
may provide ways to distinguish foreshocks from other 
earthquakes in real time [Dodge et al., 1996; Kilb and 
Gomberg, 1999]. For instance. Dodge et al. [1996] 
found that some foreshock sequences in California 
were inconsistent with static stress triggering of the 
mainshock, and concluded that these foreshocks were 
more likely a by-product of an aseismic nucleation 
process. Several points may reconcile these observa- 
tions with the cascade model. First, there are other 
mechanisms such as dynamical effects not taken into 
account in this study that may explain earthquake 
triggering. Indeed, a large fraction of aftershocks are 
inconsistent with static triggering by the mainshock. 
Second, some foreshocks may be missing in the cata- 
log used and may have triggered the mainshock. Re- 
call that, for a <b, the mainshock is more likely to be 
triggered by a small, possibly undetected, earthquake 
[Helmstetter, 2003]. 
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Figure 1. Cumulative number of foreslioclcs of type II (see definition in section 1.2) for all M > 6.5 mainshocks 
(thin gray lines), and average number of foreshocks per mainshock (heavy black line), obtained by stacking all 
(3700) foreshock sequences of M > 4 mainshocks in the SCEC catalog. The foreshocks have been selected in a 
time-space window with T — 200 days and i? = 30 km. 
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Figure 2. Rate of seismic activity per mainshock for forcshocks of type II (continuous lines) defined in section 
1.2 and for aftershocks (dashed lines) measured as a function of the time \t — tc\ from the mainshock occurring 
at tc, obtained by stacking many earthquake sequences for different mainshock magnitude intervals given in the 
legend. This figure illustrates the power-law increase of seismicity before the mainshock (inverse Omori law) and 
the power-law decrease after the mainshock (direct Omori law) , and show how the rate of forcshocks and aftershocks 
changes with mainshock magnitude. 
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Figure 3. Rate of foreshocks of type II (defined in section 1.2) averaged over 2158 mainshock with magnitudes in 
the range (4,4.5), for T — 10 yrs and for different choices of the distance R between 1 and 200 km used to select 
foreshocks around mainshocks. The total number of foreshocks of type II increases from 1001 for R ~ 1 km up to 
2280165 for R = 200 km. 
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Figure 4. Rate of foreshocks of type II (defined in section 1.2) before M > 4.5 mainshocks as a function of the 
distance from the mainshock for different values of the time before the mainshock ranging from 0.01 day (black 
line at the top) to 10 yrs (gray line at the bottom). We use logarithmic bins for the time windows, with a bin size 
increasing from 0.01 day up to 10 yrs as a geometric series with multiplicative factor 3.2. The number of events in 
each time window increases from N — 936 for 0.01 — 0.03 days up to 2096633 for 1000 — 3650 days. We evaluate the 
seismicity rate for different distances from the mainshock by counting the number of events in each shell (r, r + Ar). 
The seismicity rate is normalized by the number of mainshocks, the duration of the time window and the widths 
of the space window Ar (controlling the discretization of the curves) used to estimate the seismicity rate. The 
inset shows the characteristic size of the cluster of foreshocks, measured by the median of the distance between all 
foreshock-mainshock pairs, as a function of the time before the mainshock. The solid line is a fit by a power-law 
R with H — 0.3. Due to the large space-time window T = 10 yrs and R = 500 km used to select foreshocks, a 
large proportion of the seismicity are background events, which induces a spurious migration of seismicity toward 
the mainshock (see section 2.3). 
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Figure 5. Same as Figure 2 for foreshocks of type I (defined in section 1.2) which have been selected using a 
space-time window R — R2 = 50 km and T = 1 yr. The presented data and the statistics for aftershocks are 
the same as in Figure 2. The total number of foreshocks of type I ranges from 1050 to 5462 depending on the 
mainshock magnitude. The same mainshocks are used for the selection of aftershocks and of type I foreshocks. 
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Figure 6. (a) Magnitude distribution P(m) of foreshocks of type II using a space window R = 20 km, for different 
time windows before the mainshock, ranging from black to gray as the time tc — t from the mainshock decreases 
from 1 yr to 0.01 day. Note the progressive increase in the proportion of large earthquakes by comparison to 
the normal distribution (dashed line, 6=1). Each curve contains the same number of events, (b) difference 
q{t) dP(m) between the foreshock magnitude. The foreshock magnitude distribution is well fitted in the magnitude 
range 4 < m < 7 by the sum of two power-laws (6), with an exponent b' « 0.5 independently of the time from the 
mainshock. The amplitude q(t) of the perturbation is shown in panel (c) and the exponent b' of dP{m) is shown 
in panel (d). q{t) is fitted with a power law fit as a function oitc — t with exponent 0.3 ± 0.2 shown as the dashed 
line in panel (c). 



